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THE INHIBITION OF HYDROGENASE BY NITRIC OXIDE* 
By Atvin I. Krasnat ano D. Ritrenserct 
DEPARTMENT OF BIOCHEMISTRY, COLLEGE OF PHYSICIANS AND SURGEONS, COLUMBIA UNIVERSITY 


Communicated February 26, 1954 


The enzyme hydrogenase is of particular interest because of the comparative sim- 
plicity of its substrate. In its activation of molecular hydrogen, hydrogenase 
simulates the action of active platinum catalysts.' Although hydrogenase has not 
been obtained in pure form, considerable information is available concerning its 
properties. The evidence indicates that hydrogenase contains ferrous iron;® # 
that it forms with carbon monoxide an inactive complex,* * which is photochemically 
dissociated by light ;* > and that it can be oxidized to a ferric analogue, which can 
form a complex with cyanide.*: + Hydrogenase reacts with hydrogen according to 
the following equation, where / represents the enzyme :* 


E+ H: H:E- + Ht. (1) 


Fisher, Krasna, and Rittenberg’? have found that hydrogenase can be reversibly 


oxygenated and deoxygenated: 


In support of the view that hydrogenase is an iron-containing enzyme we now 
present evidence that it combines with and is inhibited by nitric oxide. Iron com- 
pounds, both inorganic and organic, are known to form complexes with nitric oxide.* 

Experimental.—Proteus vulgaris was grown, and cell-free extracts were prepared 
as described in a previous paper. The enzymatic activity was determined by 
measuring the rate of the exchange reaction :* © 


H, + HDO @ HD + H,0. 


The activity is expressed as the first-order rate constant, k. 

Tank nitric oxide was purified according to Farkas and Melville.° Oxygen- 
free hydrogen was prepared by passing tank hydrogen over hot copper. Nitric 
oxide-hydrogen mixtures containing from 0.1 to 10 per cent nitric oxide were pre- 
pared volumetrically. This procedure was not suitable for the preparation of very 
dilute mixtures, since traces of oxygen in the system would convert some or all of 
the nitric oxide to nitrogen dioxide. To prepare mixtures containing less than 0.1 
per cent nitric oxide, a mixture of 10 per cent NO and 90 per cent H2 was cooled to 
the temperature of liquid or solid nitrogen for 1 hour. At these temperatures all 
the nitrogen dioxide is frozen out, and the vapor pressure of solid nitric oxide deter- 
mines the final pressure of nitric oxide. These gas mixtures were kept in contact 
with the bacterial suspension for 10 minutes. The concentration of nitric oxide 
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in the mixtures of low concentration was determined by mass spectrometric 
analysis. 

Results.—The effect of different concentrations of nitric oxide on the hydrogenase 
activity of suspensions of Proteus vulgaris is shown in Table 1. Nitric oxide is a 
powerful inhibitor of hydrogenase. At low nitric oxide concentrations the inhibi- 
tion can be partly reversed by removal of the NO-H: mixture by evacuation and its 
replacement with H,. At high concentrations (1 per cent NO and above), the in- 
hibition could not be reversed even by the addition of sodium hydrosulfite, which 
reduces nitric oxide to nitrous oxide.’ The suspensions which remained inactive 
after removal of the NO did not catalyze the ortho-para hydrogen conversion.‘ 
The active suspension carried out this conversion with a rate constant, k, of 4.6 x 
10-* per minute. Nitric oxide also inhibited the hydrogenase activity of a cell-free 
extract. 

Nitric oxide is neither oxidized nor reduced by the bacterial suspension, since the 
composition of the NO-H, mixture over the bacteria does not change detectably 
in 24 hours. 


TABLE 1* 
INHIBITION OF HypROGENASE Activity BY Nitric OxIDE 


Nitric Ox1pE 


-——In PRESENCE oF NO—-~ Arrer RemMovat or NOt 
* Molarity Per Cent Per Cent 
Volume Per in Rate Constant Inhibi- Rate Constant Inhibi- 
Cent in Gas Solutiont k X 108 tion k X 108 tion 
0.002 4 X 107% 0.14 87 
0.008 L656 1077 0.08 93 0.71 35 
2 X 0.03 97 0.66 40 
| 2° 0.00 100 0.07 94 
2: 0.00 100 0.00 100 


e oe flask contained 0.1 ml. of a bacterial suspension (0.16 mg. N), in a total volume of 5 cc. of 10 per cent D:0 
uffered to 
+t The N 6 was removed by evacuation, and the flask was filled with hydrogen. 


t These values are calculated on the assumption that the solubility of NO in the buffer system is the same as in 
water. 


Discussion.—Hydrogenase appears to be sporadically distributed among the 
various species of microérganisms.'! It is likely that every organism which pro- 
duces or utilizes hydrogen contains hydrogenase. Incubation of Scenedesmus and 
certain other photosynthetic organisms under hydrogen results in the appearance of 
hydrogenase activity and concomitant photoreduction:'® '* 


Photoreduction is abolished by oxygen, and the change from normal photosynthesis 
to photoreduction is inhibited by cyanide and other agents which are known to form 
complexes with ferric iron. Since it is likely that photoreduction and photosyn- 
thesis are closely related processes, it seems reasonable that hydrogenase is derived 
from a part of the photosynthetic apparatus concerned with the splitting of water. 
There are but few biological substances which can combine reversibly with oxygen; 
all these contain iron or copper. Hydrogenase has been shown to combine revers- 
ibly with oxygen and to contain iron. This suggests that hydrogenase is closely 
related to the compounds which transport oxygen in the higher phyla. 
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We suggest that in the evolutionary process hydrogenase may have differentiated 
to yield the oxygen-producing system of the photosynthetic mechanism, the oxygen- 
transporting system, hemoglobin, and the enzymes, catalase, peroxidase, and the 
cytochromes. 

There is considerable evidence that the early atmosphere of the earth was rich 
in hydrogen and hydrocarbons and contained only traces of oxygen and carbon 
dioxide.'® In such an environment hydrogenase can be effective in the catalysis of 
reductive reactions which consume hydrogen and yield energy. Examples would 
be the reduction of CO, to CH, or of oxygen to water. Both reactions are exergonic. 

In the evolution of life, photoreduction probably preceded photosynthesis. 
The development of the photosynthetic mechanism, with the concomitant appear- 
ance of oxygen, inhibited hydrogenase and replaced the reducing reactions by oxi- 
dizing reactions as a source of energy. Hydrogenase is conceivably an evolutionary 
remnant. 

Summary.—Nitric oxide at low concentrations inhibits hydrogenase activity. 
The relation of hydrogenase to the other iron-containing enzymes and to hemo- 
globin is discussed. 


* This work was aided by a contract between the Office of Naval Research, Department of the 
Navy, and Columbia University (ON R-26602). The deuterium oxide was obtained on allocation 
of the Atomic Energy Commission. 

+ National Science Foundation Predoctoral Fellow, 1953-54. This report is from a dissertation 
to be submitted by Alvin I. Krasna in partial fulfilment of the requirement for the degree of 
Doctor of Philosophy in the Faculty of Pure Science, Columbia University. 

t Qn leave, 1953-54. Present address: Institute for Advanced Study, Princeton, N.J. 
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A BIOCHEMICAL AND PHARMACOLOGICAL SUGGESTION 
ABOUT CERTAIN MENTAL DISORDERS 


By D. W. Woo.uey AND E. SHaw 
ROCKEFELLER INSTITUTE FOR MEDICAL RESEARCH, NEW YORK 
Communicated February 16, 1954 


Recent findings in this laboratory" ? and elsewhere* * have permitted an under- 
standing of some aspects of mental diseases in relation to the hormone-like com- 
pound, serotonin. Furthermore, these findings lead directly to a suggestion for 
a logical treatment of diseases known as “schizophrenia.’”’ The experimental ob- 
servations have been made solely on laboratory animals, but they have reached a 
point where clinical trials in human psychiatric patients are required to test the 
validity of the conclusions. Being only biochemists, we are unable to do these 
experiments on patients and can only hope that this paper will stimulate those who 
are professionally qualified to undertake in man what we cannot pursue further 
in laboratory animals. 

This discussion is going to revolve about serotonin,® or enteramine, if Erspamer’s 
terminology is followed.* Serotonin is one of the latest hormone-like substances 
to be discovered. Chemically it has been shown to have the structure given in 
Figure 1.7. Being a simple molecule, it has been synthetically produced without 
great difficulty® ° and is thus readily available. It was discovered because it is 
the vasoconstrictor long known to form in the serum when blood clots. Erspamer’s 
work® with it, done independently of that of Rapport, et al.,> on the vasoconstrictor 
material, was based on the abundant occurrence of this new compound in the entero- 
chromaffinic cells of the gastric and intestinal mucosa. Serotonin has now been 
isolated from several different organs, including the brain, of a wide variety of 
animal forms.'°-!2_ There can be no doubt of its wide-spread occurrence in living 
things. Furthermore, a variety of pharmacological properties in isolated organs 
and tissues has been demonstrated, in addition to the vasoconstrictor effect.® 
The pharmacological properties which have been described thus far are attributable 
in large part to the ability of serotonin to cause various smooth muscles to contract; 
but there are indications that it has other effects which may become more clearly 
defined as time for study of them goes by. 

Aside from the occurrence and importance of serotonin, a second general idea 
is prerequisite to this discussion. It is now well known that several classes of 
drugs are related chemically to individual hormones and other essential metabo- 
lites. In fact, a major part of the pharmacological effects of these drugs is 
attributed to a specific interference with the biological functioning of these metabo- 
lites to which the drugs are related structurally. That is, the drugs are antimetab- 
olites. 

Our thinking and experimentation about certain mental disorders has been in 
the following vein. Several synthetic compounds have been produced which are 
very closely related in structure to serotonin. These were shown to antagonize, 
in a competitive fashion, the contractions of artery walls caused by serotonin.’* ” 
Also, the fact that the ergot alkaloids are structurally related to serotonin was ap- 
preciated, and it was demonstrated that several of these actually did antagonize 
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the action of serotonin on artery walls.''? A derivative of the ergot alkaloids, 
namely, lysergic acid diethylamide (cf. Fig. 1), was then shown by Gaddum, e¢ al.,* 
to act as an antimetabolite of serotonin on smooth muscles. It is well known that 
this amide calls forth in man mental disturbances resembling those of schizo- 
phrenia.’ From these experiences we therefore conceived the idea that the mental 
disturbances caused by lysergic acid diethylamide were to be attributed to an inter- 
ference with the action of serotonin in the brain. The failure of ergotamine or of 
ergotoxin to produce similar mental aberrations was postulated to be due to the 
failure of these more complex derivatives of lysergic acid to penetrate into the 
brain. Gaddum also was cognizant of the mental effects of lysergic acid diethyl- 
amide and of the occurrence of serotonin in the brain. We have surmised that he 
has been thinking, just as we have, about the relationship of serotonin to the mental 
disturbances induced by the drug. 


Even before the study of the ergot R 
alkaloids as antimetabolites ‘of serotonin, 
another type of alkaloid, viz., yohimbine, y uae 
was recognized to be structurally similar 
Serotonin | 
to serotonin and was actually demon- 
strated to function as an antimetabolite i 
of it in the artery-wall test. * Then two Ergot alkaloid 
When R is diethylamino-, 
members of yet a third class of alkaloid— | acid diettylamide; 
ar alk ids—w When a peptide, 
the harmala alkaloids—were seen to be i & ~Ergotamine, Ergotaxine, etc) 
structural analogues of serotonin and 
Harmine 
were actually shown to antagonize its 
action on artery walls.? CHs 
Although the chemical structures of the 
3 

ergot alkaloids, yohimbine, and the har- 
mala alkaloids are not very similar, all 
three classes of drugs have in common a Medmain Yonimpine ! 
resemblance to serotonin (Fig. 1). In Fig. 1. 


each case the hormone has been modified 
in a different way in order to arrive at the particular class of alkaloid. Further- 
more, they have all been shown to function as antagonists to the hormone. 

The important point for the present discussion is that they all have something 
elseincommon. This is the ability of at least some members of each group to cause 
mental disturbances. If one does no more than read a textbook of pharmacology, 
one is struck by this fact.!* Witness the production of schizophrenia with simple 
amides of lysergic acid, the hallucinations and euphoria called forth by some of 
the harmala alkaloids, and the mental disturbances attributed to the use of yohim- 
bine. Perhaps even the aboriginal use of crude yohimbine as an aphrodisiac is 
further small evidence. 

Bearing in mind the mental disturbances caused by alkaloids known to be antag- 
onists of serotonin, let us examine other types of compounds which are not alkaloids 
but merely synthetic serotonin antagonists. What do we find? The highly active 
synthetic antimetabolite of serotonin, medmain (cf. Fig. 1),2° when injected into 
mice, calls forth convulsions much resembling those of human epilepsy. Hydra- 
zino-phthalazine, which is said by Taylor, Page, and Corcoran‘ to antagonize some 
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of the biological effects of serotonin, has been found to produce in man mental aber- 
rations somewhat similar to those of schizophrenia.”! 

The thesis of this paper is that these pharmacological findings indicate that ser- 
otonin has an important role to play in mental processes and that the suppression — 
of its action results in a mental disorder. In other words, it is the lack of serotonin 
which is the cause of the disorder. If now a deficiency of serotonin in the central 
nervous system were to result from metabolic rather than from pharmacologically 
induced disturbances, these same mental aberrations would be expected to become 
manifest. Perhaps such a deficiency is responsible for the natural occurrence of 
the diseases. 

Of course, this may be the wrong inference from the pharmacological findings. 
It must be recognized that the classes of alkaloids and synthetic drugs we have been 
discussing are the very ones which, at one time or another, have been used for the 
reduction of high blood pressure. The mental disturbances may be no more than 
the reflection of diminished blood flow which they may cause in the brain. To 
this objection there is as vet no adequate answer, and it may be that it is the true 
explanation. 

If the hypothesis about serotonin deficiency is accepted, then the obvious thing 
to do is to treat patients having appropriate mental disorders with serotonin. — It 
is here that one must be a clinician rather than a biochemist; for how is one to in- 
duce, or even more to diagnose, schizophrenia in laboratory animals? 

One may ask whether the administration of serotonin to humans is a safe pro- 
cedure. More information is certainly needed on this point, but the few reports in 
the literature, where it has been done, as well as the toxicological studies in animals 
indicate no great risk.??~*4 

If serotonin is to be given to selected mental patients, it will probably have to 
be injected intramuscularly or subcutaneously. What assurance is there that it 
will find its way to the brain, even though it were to escape destruction by amine 
oxidase and possibly other enzymes in the periphery? The answer is that we have 
no assurance that it will reach the central nervous system or persist there. Most 
experience in experimental animals would, in fact, indicate that serotonin is rapidly 
destroyed, especially when given by the intravenous route. We have made estima- 
tions (by chemical assay) of the amount of serotonin found in the brain of a mouse 
10 minutes after the intraperitoneal injection of 5 mg. of it, and have found less 
than 0.01 mg. Nevertheless, because mice are not necessarily the same as men in 
this respect, the clinical trial of serotonin in mental disorders may still be worth 
while. 

If destruction or failure to penetrate the central nervous system should render 
abortive attempts to treat schizophrenia with intramuscular serotonin, then the 
idea should not be abandoned until the biochemists have made one more effort. 
This is so to alter the structure of the hormone that it will remain long-acting in the 
body and will penetrate into the brain from the peripheral circulation. This may 
not be an impossible task. There are already several indications that such desira- 
ble properties can be built into metabolites such as serotonin. 

In summary, the suggestions we wish to make are the following: (1) serotonin 
probably plays a role in maintaining normal mental processes; (2) metabolically 
induced deficiency of serotonin may contribute to the production of some mental 
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disorders; (3) serotonin or a long-acting derivative of it may prove capable of al- 
leviating disorders similar to schizophrenia. 
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DYNAMIC PROGRAMMING AND A NEW FORMALISM IN THE 
CALCULUS OF VARIATIONS 


By RicHarp BELLMAN 
Communicated by John von Neumann, February 9, 1954 


1. Jntroduction.—In a series of papers,'!~!! we have treated a number of mathe- 
matical problems arising from multistage decision processes. Problems of this type 
occur in the theory of probability;' * in mathematical economics; 7 1 
in control processes ;2 * in learning processes ;* ° and in many other fields as well. 

In this paper we wish to show that the functional-equation technique introduced 
in the above works may be used to provide a new approach to some classical prob- 
lems in the calculus of variations. In addition to furnishing a new analytic weapon, 
we feel that the method has great potentialities as a computational tool. As we 
have pointed out previously,!: * ‘ this approach seems ideally suited to the handling 
of variational problems involving stochastic processes. This point will be further 
enlarged upon in some forthcoming publications. 

. To illustrate the approach in its simplest setting, we shall consider first the prob- 
lem of maximizing fo! F(x, z) du, subject to the constraint dx/du = G(a, z), x(0) = 


4 
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c, where the analytic detail is at a minimum, and then turn to the eigenvalue prob- 
lem derived from 


= 0, x) = x(1) = 0, (1.1) 


which is rather less straightforward. 

In both cases we shall derive a partial differential equation for the desired quan- 
tity. The method we sketch below is also applicable to multidimensional eigen- 
value problems whenever the underlying space has certain symmetry properties. 
Further results, together with a detailed exposition of the results contained herein, 
will appear in another publication. 

2. The Maximization Problem.—Let us now consider the maximization problem 
described in the first section. We shall consistently assume that all functions that 
appear possess all the differentiability properties required to make our operations 
legitimate, since we are primarily interested here in presenting the basic formalism. 
The essence of our method lies in considering the maximum value as a function of 
the parameters which describe the state of the process, the “state variables.” In 
this case these are c and ¢. We write 


Max F(a, z) du = t). (2.1) 


The classical approach regards ¢ and ¢ as fixed and considers fo’ F(x, z) du as a 
functional of z. In contrast, we regard c and ¢ as parameters and seek to determine 
2(0) as a function of ¢ and ¢. 

Speaking in terms of a multistage process, in place of determining the optimal 
continuation from one fixed position, we try to find the optimal first step from any 
position. 

Let us proceed to the analysis which will clarify the above general remarks. We 
have 


f(e,s + 0) = Max fo’ t' F(a, z) du = 
Max [Jfo* F(a, 2) du + *' F(a, 2) duj. (2.2) 


It is clear that whatever the choice of z(w) inO < u < s, if we wish to maximize 
the right side of (2.2) we must choose z(w) ins < u < s + ¢ so as to maximize 
Sv + F(x, 2) du subject to the constraints dx/dt = G(x, z),.x(s) = e(s). Here e(s) 
is the new initial value obtained from the differential equation dx/dt = G(x, 2), 
x(0) = c. 

In view of the invariant nature of the problem with respect to translations, we 
see that we must have 


F(a, 2) du = fle(s), t). (2.3) 
Hence (2.2) yields the fundamental functional equation 


f(c, + = Max { fo* F(x, z) du + ff{e(s), (2.4) 
z [0, s] 


The notation 2z[0, s] indicates that the maximization is over all z(w) taken over 
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0 <u<s. Assshrinks to 0, the choice of a function reduces to a choice of 2(0), a 
quantity which we shall call w for typographical convenience. A straightforward 
calculation yields 


) = Max [F(e, w) + Ge, w)f,] (2.5) 


as the limiting form of (2.4) ass—0. The maximizing w is obtained by equating 


\- the partial derivative with respect to w to zero, 0 = F,, + Gyf,.!2 Combining this 
\- with (2.5), we obtain the system of equations 

= — GF y)/Gy = Kc, wv), 

fe = —Fo/Go = L(c, w). (2.6) 
¥ To derive a partial differential equation for w, we employ the identity (f)). = (fede. 
“ The result is 

18 

a. K. + Kyw, = (2.7) 
of 

m As is well known,'* the general solution of this equation may be obtained from the 


general solution of a system of first-order ordinary differential equations. Since 
cis known for ¢ = 0, to complete the solution we require only the value of w at ¢ = 0. 
1) For small ¢ we have 


So’ F(x, z) du = Fle, w)t + O(#). (2.8) 
- Hence w at ¢ = 0 is determined by F,, = 0. 

Taking G(x, z) = 2, we are reduced to the familiar problem of maximizing 
al So’ F(x, x’) du. It is readily verified that the characteristic equations obtained 
y from (2.7) are equivalent to the usual Euler equation. 

The general problem of maximizing F(21, 22, . Uni 21) 5 2m)du sub- 
re ject to the constraints dx;/dt = G,(x, 2), 2,(0) = c; may be attacked in the same 
fashion. 

3. The Eigenvalue Problem.—The eigenvalue problem posed in §1 is equivalent 
to that of determining the successive minima of fj! x’? du subject to the constraints 
2) So! ox? du, x(0) = x(1) = 0. To treat the problem using the functional-equation 

approach, we imbed it within the more general problem of determining the minima 
ue of = du subject to the constraints 
ze 
8) (a) = x(a+t) = 0, 
(+) o(watdu tk ou) (att—wedu=1. (3.1) 
ve Let Min J(u) = f(a, k, t). Then, if, as above, we write 

fla = fot dut du, (3.2) 

3) 

we cannot derive a functional equation immediately, since 2(a + s) is not neces- 

sarily zero. To simplify the analysis, let us pass directly to the derivation of the 
1) partial differential equation, assuming that s is an infinitesimal. It is sufficient to 


consider only terms which are of zeroth or first order in s. Since «(a + s) = 
st’(a) +... , we set y(u) = x(u) — sx’(a) (a+s+t—u)/t. Then y(a + s) = 


w 
ae 
er 
% 
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y(a + s+ t) = 0, to the order of our approximation, and x’(u) = y’(u) — sx’(a)/t. 
Thus, to terms in s?, 


9 t 


The constraint of (3.16) becomes 


du + E +s +t — u) du = 
1 — sy[k, x’(a), t] + O(s*), (3.3) 

where 
_ kx'(a) 


St (a+ t — du. (3.4) 


x’(a), 


Making a change of variable y = (1 — (sp/2))w, to reduce the constant term in 
(3.3) to 1, the constraint takes the form 

ky 22’(a) 


2 


du + +s | 


+! (a + s +t — w) du = 1 + O(s?). (3.5) 


The equation in (3.2) becomes 


f(a,k,s +t) = min ) Sit dut 


x’la, 


22’ 
+sk+s ‘|| + 0(s?). (3.6) 
Letting s ~ 0, this yields the partial differential equation 
x’(a) “ ) 


Since y = x’(a)® is independent of x’(a), the unique minimum occurs when 
kb 2 
2x'(a) + + t — Pf = 0. (3.8) 


Eliminating x’(a) by means of this last relation, we obtain a nonlinear partial differ- 
ential equation for f. 
The behavior of f for small ¢ may be determined by solving the problem of mini- 
mizing J," * ‘x’? du subject to the constraints 
x(a) = x(a + t) = O, (3.9) 
for dutkh t' ou) (a+ t — wa du = 1. 


This problem, in turn, may be treated by the partial differential equation approach, 
using the fact that the solution for k = 0 is immediate. 


'R. Bellman, ‘On the Theory of Dynamic Programming,” Proc. Natu. Acap. Sct., 38, 716- 
719 (1952). 
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2. Bellman, “On Bottleneck Problems and Dynamic Programming,” ibid., 39, 947-951 
(1953). 

3. Bellman, “On Computational Problems in the Theory of Dynamic Programming,”’ Sym- 
posium on Applied Mathematics, Santa Monica, 1953. 

4 R. Bellman, ‘Some Functional Equations in the Theory of Dynamic Programming,” Proc, 
Natt. Acap. Sct. (to appear). 

> R. Bellman, “On Limiting Theorems for Non-commutative Processes,” Duke Math. J. (to 
appear ). 

6 R. Bellman, “A Problem in the Theory of Dynamic Programming,” Econometrica (to appear). 

7 R. Bellman, ‘“‘An Optimal Inventory Equation,” sub. to Pac. J. Math. 

8 R. Bellman, I. Glicksberg, and O. Gross, “On Some Variational Problems in the Theory of 
Dynamic Programming,” Proc. Natu. Acap. Sct., 39, 298-301 (1953). 

9 R. Bellman, T. FE. Harris, and H. N. Shapiro, Some Functional Equations Occurring in the 
Theory of Decision Processes, Rand Papers, No. 382. 

 R. Bellman and S. Lehman, “On the Continuous Gold-Mining Equation,’ 
Acab. Sct. (to appear). 

1! R, Bellman and O. Gross, ‘Some Combinatorial Problems Arising in the Theory of Multi- 
Stage Production Processes” (to appear ). 

12,4 similar formalism has been applied by R. Isaacs in his theory of differential games, i.e., 
systems of differential equations of the form dx;/dt = Min Max F(z, z, w). 

w 2 


Proc. NATL. 


13 R. Courant and D. Hilbert, Methoden der mathematischen Physik, New York: Interscience 
Publishers, 1943. 


A CHARACTERIZATION OF TAME CURVES IN THREE-SPACE 
By O. G. Harrop, Jr., H. C. Grirrira,* E. E. Posty 


DEPARTMENT OF MATHEMATICS, UNIVERSITY OF TENNESSEE 


~ 


Communicated by G. T. W. Whyburn, March 4, 1954 


1. Introduction.—A subset K of 3-space has been called tamely imbedded if there 
is a homeomorphism on space throwing K onto a polyhedron. If XK is a simple are 
or simple closed curve (i.e., a 1-manifold) that is contained in a plane, it follows 
quickly from classical results of Schoenflies that K is tamely imbedded. If the 1- 
manifold is not a subset of a plane, it is necessary to distinguish whether AK is an 
are or a closed curve. If K is a polygonal are, it is already tame and also has the 
property of being equivalent to an interval under a semilinear map on 3-space. If 
K is a polygonal closed curve, it is already tame, but the intermediate step of map- 
ping K into a plane is neither advantageous nor possible if K is knotted. 

About thirty years ago an example was given of an are K which is not equivalent 
to a subset of a plane under any homeomorphism on 3-space. This example and 
others closely related to it, including some simple closed curves, were studied by 
Antoine,! Alexander,” and later by Fox and Artin* and serve to emphasize the path- 
ological difficulties that may occur. The nature of these examples suggests that the 
difficulties are of a local nature as regards the manner the |-manifold is imbedded 
in 3-space. This point of view is borne out by recent results of R. H. Bing‘ and E. 
E. Moise,’ who prove that a locally tamely imbedded set is tamely imbedded. 

The purpose of the present paper is to characterize, by means of positional in- 
variants, those 1-manifolds which are locally tamely imbedded. By the above- 
mentioned result, this implies a characterization of the tamely imbedded 1-mani- 
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folds. If we restrict attention solely to simple closed curves, the purpose may be 
put thus: to distinguish the curves which determine classical knot-types from those 
that are pathological. 

2. Definitions.—The symbols and vu have their usual set-theoretic meaning, 
A \ B means the collection of points in A but not in B, and S(A, e) means the col- 
lection of points whose distances to some point of A is less than e. If B is a homo- 
geneous polyhedron, |0B| will mean the point set carrying the chain 0B. Three- 
space is denoted by R°. 

The set XY ¢ R? is locally polyhedral at p ¢ R*, provided that there is some neigh- 
borhood N of p in R* whose intersection with X is null or a finite polyhedron. The 
set X is called locally polyhedral modulo Y (mod Y) if it is locally polyhedral at each 
point of R* \ Y. 

A homeomorphism h of R* on R? is called semilinear if and only if there are sub- 
divisions o and + of 2* into simplices such that every simplex of ¢ is mapped affinely 
on some simplex of 7. 

Let J be an are or simple closed curve. Then J is said to have property ® provided 
that for each « > 0 and point x eJ there is a topological 2-sphere K ¢ S(x, €) which 
is locally polyhedral at points of K \ J, contains z in its interior, and meets J in a set 
whose cardinal is the (Menger-Urysohn) order of x in J.® 

An are (simple closed curve) J is said to have property Q, provided that there is 
a topological closed 2-cell G (a disk) with J ¢ [0G] (J = |dG|) which is locally 
polyhedral modulo 

An arc (simple closed curve) J is said to have property Q at x, provided that there 
is a closed 2-cell G such that (i) @ is locally polyhedral mod J; (ii) G 9 J is an are 
(in | OG) ); and (iii) G9 J is the closure of a neighborhood of x relative to /. 

Since knotted polygonal curves exist, J may have property Q at every point and 
still fail to have property Q. Examples showing that neither property ® nor prop- 
erty Q alone is sufficient to guarantee tame imbedding are easily constructed. Fox 
and Artin’s example 1.4* is an are having property @® but is wildly imbedded. 
Joining the end-points of this are together by a segment meeting it only at its end- 
points provides a wild curve with property ®. Example 2.1 of the same paper is a 
curve bounding a 2-cell, which may be taken locally polyhedral save at two points 
of its boundary and which has a boundary that is wildly imbedded. 

3. The Characterization.—It is first shown that the are problem depends on the 
closed-curve problem. That is, if an are J has the properties ® and Q, there is a 
simple closed curve K > J that has properties ® and Q. The following theorems 
are then proved. 

I. Jf there is a homeomorphism of R* onto R* carrying J into a subset of a plane, 
then J has properties @ and Q. 

It is not obvious that the properties ® and Q defined above are positional invari- 
ants. The arguments given to prove Theorem I establish this fact. 

Il. If J tsa simple closed curve in R* having properties ° and Q, then J is tame. 

The method of proof of this theorem may be briefly indicated. Let 3 denote 
the unit circle in the xy-plane, @ the disk in the zry-plane bounded by 3, and J; 
the torus bounding the region S(3, 1/7 + 1),7 = 1, 2,.... Let }a;} be a count- 
able dense subset of & and let D, be the disk which is the intersection of the closure 
of S(a;, 1/1 + 7) and the plane determined by the z-axis and a;, 7 = 1, 2,.... 
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Then, since J has properties @ and Q, sets G, 7;, and D; may be constructed so that 
thesetsGu u T;u D,;and uv T,u u D; are homeomorphic. 

The extension of this homeomorphism to a self-homeomorphism of 2* which maps 
J onto 3 is then made. In obtaining this extension strong use is made of the result 
due to Alexander’? that one of the complementary domains of a polyhedral torus in 
spherical 3-space is homeomorphic to a complementary domain of a standard torus. 

It is known’ that if S;, and S., are two pairs of polyhedral spheres in 
with S; contained in the interior of S;’ (Int S;’), then there is a homeomorphism of 
the closure of the set (Int S,’) % (Ext S,) onto the closure of (Int S.’) 9% (Ext So). 
It is evident that quite a diverse collection of types of regions are bounded by pairs 
of disjoint polyhedral tori, but an extension of the above-mentioned result to suit- 
ably restricted pairs of “concentric” polyhedral tori is made. This provides a 
chief tool for the proof of the above theorem. 

III. Jf the l-manifold J has properties ® and Q at each point, then J is locally tame 
and conversely. 


* This work was begun jointly during the tenure of a National Science Foundation Fellowship 
by Griffith. 

1 L. Antoine, “Sur la possibilité d’entendre l’homéomorphisme de deux figures a leur voisinage,” 
Compt. rend. Acad. Sci., 171, 661-663 (1920). 

2 J. W. Alexander, “An Example of a Simply Connected Surface Bounding a Region Which Is 
Not Simply Connected,” Proc. Nari. AcAb. Sct., 10, 8-10 (1924). 

3. Fox and E. Artin, “Some Wild Cells and Spheres in 3-Dimensional Space,’’ Ann. Math., 
49, 979-990 (1948). 

‘R. H. Bing, ‘Locally Tame Sets Are Tame,” Ann. Math., 59, 145-148 (1954). 

5. Moise, “‘Affine Structures in 3-Manifolds. VIIT,’’ Ann. Math., 59, 159-170 (1954). 

6 QO. G. Harrold, Jr., “The Enclosing of Simple Ares by Simple Closed Surfaces in Three-Space,”’ 
Bull. Amer. Math. Soc., 59, 369 (1953). To appear in the Duke Mathematical Journal. 

7J. W. Alexander, ‘On the Sub-division of Space by a Polyhedron,” Proc. Nati. Acap. Set., 
10, 6-8 (1924). 

W. Graeub, semi-lineare Abbildungen,”’ Sitzungsb. Heidelberger Akad. Wissensch., Part 
N, pp. 205-272 (1950). 


CONFORMAL GEOMETRY AND ELEMENTARY PARTICLES 


By R. INGRAHAM 


INSTITUTE FOR ADVANCED STUDY, PRINCETON, N.J. 


Communicated by Oswald Veblen, February 25, 1954 


If the conformal replaced the Lorentz geometry as the geometric base of physical 
field theories, what new physical ideas could be expected? On the purely kinemati- 
cal level, that is, before even the assumption of a particular set of (classical or quan- 
tized) field laws, much can be said. This note summarizes the main finding of a 
paper to appear shortly. The space in question is the 5-dimensional one of all 
spheres in R, (flat 4-space of sig. [++-+—]); the fundamental invariant, the angle 
9, under which two spheres intersect. This is an essential difference from former 
attempts to apply the conformal geometry to physics, which used only the 4-dimen- 
sional space of points (null spheres) as the domain of the physical fields. Our re- 
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sults may be summed up briefly by saying that a new concept of elementary par- 
ticle results. From being an entity superimposed on the basic geometry, as is the 
present view, it becomes integrated into the basic geometry, from which it is logically 
inseparable. Any geometry built on this geometry must have the following kine- 
matical properties. 


1. The spheres are labeled by the physical co-ordinates (r, ¢, #) (Dim L, 7, > 
resp.) when considered as the spheres 


— h 2 


c 
in space-time (‘‘g-space’’), (g, 7) Lorentz co-ordinates, and by (p, E, m) (Dim 
MLT~, ML?T~*, M resp.) when considered as the spheres 


— p)? — 1/c°(y — £)? = e(me)? (2) 
in momentum-energy space (‘‘p-space’’), (a, 7) Lorentz co-ordinates. In these 
(‘natural’) co-ordinates the angle metric has the forms, resp., 


de? = «¢ (““) (dr? — c? dt? — ch? /c? 
(3) 

de? = ¢(mc)~* (dp? — 1/c? dE? — ec? dm’). 
The “sphere” (hyperboloid as real locus) (1) represents a finite (i.e., nonlocalized) 
test particle of size “at” the event t) in g-space. Similarly, (2) represents 


a finite test particle of rest mass m “at” the momentum-energy (p, 2) in p-space. 
The nonlocalizations in qg- and p-space are supposed to vary directly with hw/c 
and me, resp.! Fields will thus be defined on these particle states (thing labeled by 


[r, t, w| or [p, EZ, m]) rather than on the events or 4-momenta alone. 

2. The group of conformal transformations in Ry (proper Lorentz transforma- 
tions, translations, constant dilations, inversions in spheres) is just the 15-param- 
eter group of motions (metric-preserving transformations) of the Riemannian 5- 
space (of constant unit curvature) with metric (3). These motions connect the 
physically equivalent frames of the theory. A special motion 


= inverse z* = —Ap*/[p}?, (a = 1,..., 5) 
(4) 


[x]? (z*)* (x*)? —e€ (x*)?, 


where x! = 2, etc., 24 = ct, = hw/c; p' = etc., = E/c, = cm, exists, 
which takes one from q-space (dim x“ = L) to the associated p-space (dim p* = 
MLT~') and vice versa. Hence any conformal theory exhibits an automatic 
reciprocity? between q- and p-space having the two aspects: (a) any equation in 
q-space gives another true equation in p-space on making the substitution «* > p* 
in it, and conversely; (b) any solution for the fields relative to a g-frame tensorially 
transformed under (4) gives the same solution relative to the associated p-frame, 


MATHEMATICS: R. INGRAHAM 239 


Vou. 40, 1954 


and conversely. Moreover, the conjugate co-ordinates p* and 2“ of a particle 
state satisfy by (4) the uncertainty-relation type equation, 


[px] = per — Et — ehwm = — hh, (5) 


so that the nonlocalizations of the test particle in p- and q-space are in a reciprocal 
relation, Planck’s #4 being a measure of the intrinsic correlation. The subgroup of 
motions involving relative motion of observers represents uniform relative acceler- 
ation of Lorentz frames as found by Haantjes* (Lorentz group «——~ zero accelera- 
tions). Test particle size and mass are invariant under the Lorentz group but nonin- 
variant under the accelerative motions, being renormalized nonuniformly (depending 
on «* or p*). The group of motions can be built up by adjoining to the subgroup 
preserving the physical dimension the single transformation (4) which switches 
and p-space. 

3. The motion of test particles is described by geodesics in sphere space—i.e., 
certain 1-dimensional families of particle states. In our case (force-free) they can 
be written 


r—r =v(t—t), hw/e = |1 — c(t — b), 
6 
Cc 


the null geodesics; and 


> 


=> v(t lo), hw/e = | x 
9 \o do 2 "/2 
1 — 
=U me = 1 u2/c?| /2 x ( ) 


the non-null geodesics. v, ro, fo, Ao, U, Po, Eo, lo are (dimensional) integration con- 
stants, and the condition Sgn(1 — v?/c?) = Sgn(1 — wu?/c?) = —e is imposed. 
(e = +1 seems to be excluded, as giving spacelike motion here and a spacelike rela- 
tion between the spheres [1] and [2] and their centers.) These are free elementary 
particle paths. There are good reasons for thinking that the null geodesics repre- 
sent photon and electron propagation. v = c or u = ¢ (conformal covariant state- 
ment) defines the photon; it follows that w = m = 0. The increase of liw/e with 
time and me with energy shows that the free particle tends to dislocalize in g- and 
p-space with increasing time, energy resp. Born conjugating? the q-paths show 


that p, E, and m are (asymptotically) constant in time and related to one another 


with « = v as expected from special relativity dynamics. This result here follows 
from these motion equations. Free elementary particles cannot be accelerated by 
the uniform accelerative changes of frame, since the uniform-velocity form of (6) 


= 
9711/2 
> 
} 
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and (7) holds for all our equivalent frames. ‘vhis contradicts relativity dynamics 
and suggests a Michelson-Morley type experiment capable in principle of choosing 
between the conformal and Lorentz geometries for physics. 


1 The exact observational significance of these hyperboloids is not yet clear. Concerning this 
model of nonlocalized particles, see H. Freistadt, Phys. Rev., 92, 1015 (1953). 

2 This reciprocity may be called Born reciprocity, and the special motion (4), Born conjugation, 
after the principal exponent of this sort of idea in modern physics. 

3 J. Haantjes, Proc. Kon, Ned. Akad. Wet., 43, 1288 (1940); ef. also Io. Hill, Phys. Rev., 73, 143 
(1947). 


APPROXIMATION BY BOUNDARY VALUES OF ANALYTIC FUNCTIONS 


By JACQUELINE PENEZ 


UNIVERSITY OF MINNESOTA 


Communicated by Marston Morse, February 17, 1954 


F. Riesz! has investigated the problem of minimizing the integral 


|f(2) — p(2)| | dz 


where f(z) is a prescribed polynomial in 1/z and p(z) ranges over a class of bound- 
ary functions of functions analytic for |z| <1. Extremal problems of a similar 
type have been considered by Kakeya,? Golusin,®? Doob,* Garabedian and Schiffer,’ 
MacIntyre and Rogosinski.* In this paper we will discuss the integral 


Sc\f(@) — plz)|*| 


fork > 1, where C is the rectifiable boundary of a finitely multiply connected domain 
D; f(z) is a complex-valued function whose kth power is Lebesgue integrable on C; 
and p(z) ranges over the boundary functions of functions analytic in D. 

More precisely, we let D be a finite region of the z-plane bounded by a system 
of curves C, consisting of an outer rectifiable Jordan curve Cy and v rectifiable Jor- 
dan curves (), ..., C, lying within Cy and having no points in common with it or 
with each other. We let z(s) be the parametric representation of C in terms of are 
length along C. By L*(C), where k > 1, we denote the class of all complex-valued 
functions f(z) defined on C such that 


= Sel fe)|* ds 


exists finite in the Lebesgue sense. Since ||f — g||, = 0 if and only if f = g almost 
everywhere on C, we replace the original space L‘(C) by the space of equivalence 
classes determined by the relation ||f — g||, = 0 and henceforth refer to the latter 
class as L*(C). The class B*(C) will denote the subclass of functions in L*(C) 
which satisfy the following conditions: 


Se f(z)2" dz = Oforn = 0, 1,2,..., (1) 
Se fez — dz = Oforn = 1,2,....;¢= 1,...,9, (2) 


’ 


where a; ¢ Int (C;). Another subclass of L*(C) to be considered is 


= 
| 


VS 
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AX(C) = Cl f analytic in D}, 
where the closure is taken in the space L‘(C). We note that 
A‘K(C) © BC) fork > 1. 


Definition of Smirnoff Domains.—Case (i): Let D be the interior of a rectifiable 
Jordan curve. If z = g(t) maps || < 1 conformally onto D, then, by a theorem of 
F. and M. Riesz, log | o’(t)| e H, (| ¢| < 1), and consequently log ¢’(t)| has a 
Poisson-Stieltjes integral representation. If, moreover, log | g’(t)| has a Poisson- 
Lebesgue integral representation, then ) will be said to be of Smirnoff type.” * 

Case (ii): Let D be the exterior of a rectifiable Jordan curve C and a « Int (C), 
lzilz — a <r} ¢ Int (C). In this case, D will be said to be of Smirnoff type if 
D’ is of Smirnoff type, where D’ is the image of D under z = ¥(w) = (r/w) + a. 

Case (iii): Let D be a finitely multiply connected domain with a boundary C as 
described above. Then D is of Smirnoff type if Int(Co) and Ext(C;) fori = 1,..., 
v are domains of Smirnoff type. We let S denote the class of domains of Smirnoff 
type. 

To f(z) « B*(C) corresponds a function F(z) analytic in D, whose nontangential 
boundary values coincide with f(z) (almost everywhere) on C. Hence B*(C) is a 
class of boundary functions of functions analytic in D. In particular, if D is simply 
connected and D ¢« S, then B[H,(D)] = B*(C) for k > 1 where B[H,(D)] denotes 
the class of boundary functions of functions in H,(D)), the usual Hardy classes.* 1 1! 

TueoreM |. If D ¢S, f(z) B*(C) and g(z) where 1/k + 1/q = 1, then 


Sc S(2)g(2) dz = 0. 


Corotuary 1. If D S, then A*(C) = B*(C) fork > 1. This follows from 
Theorem | by the Hahn-Banach theorem. 

CoroLuary 2. If D S and D is simply connected, then B[H,(D)] = BY(C) = 
A*(C) fork > 1. 

Now we proceed to the extremal problem itself. Both classes B*(C) and A*(C) 
are closed linear subspaces of L*(C), and the spaces L‘(C’) for k > 1 are uniformly 
convex Banach spaces.!?. From these facts, we deduce Theorem 2. 

THEOREM 2 (Existence and Uniqueness Theorems). 

Let f(z) « L*(C), where k > 1. Then there exists a unique function g(z) « B*(C) 
such that 


alle = It Ale 


Similarly, there exists a unique function g(z) « A*(C) such that 


We shall call the function g(z) which minimizes ||f — hj|, over B‘(C) [or A*(C)] the 
minimizing function, and the difference f(z) — g(z) a minimal function. 

TuroreM 3 (Characterization of the Minimal Functions). 

Let 0 # f(z) « L'(C), wherek > 1. Then f(z) is minimal over A*(C) [or B*(C)] if 
and only if | f| */z’(s)f coincides (almost everywhere) on C with a function F(z) « BY(C) 
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for A“(C)], where 1/k + 1/q = 1. We note that minimal functions can vanish at 
most on a set of measure zero, since a function in B“(C) where q > 1 vanishes at 
most on a set of measure zero. 

This extremal problem is related to another extremal problem, namely, that of 
finding the norms of linear functionals on the classes B*(C) and A*(C). This maxi- 
mum-minimum duality in extremal problems has been studied by MacIntyre and 
Rogosinski® and Shapiro,'* and our results overlap with theirs for the case of the unit 
circle. In this connection, we have the following theorem: 

TueoreM 4. For each w(z) ¢ L“(C)(q > 1), where 1/k + 1/q = 1, the function 
L defined for f(z) « B*(C) [or A*(C)] by 

Lf) = So dz 
isa linear functional on B*(C) [or A*(C) ] and 
= max. | L(f)| = min. — glo 
WFllkS1 ge Ag 
fe BE(C) 
or || 42 = max. | L(f)| = min. — glia. 
lfiles1 g« Ba 


fe Ak 


In fact, 


|| Li] |4 PF) — golle, 


where go(z) is the minimizing function over A“(C) [or B(C)] of w(z) and F(z) = 
|w(z) — go(z)|%/2’(s) [w(z) — go(z)]. Conversely, if L is a linear functional on 
B*(C) [or A*(C)], then there exists w(z) ¢ L“(C) such that L (f) = SO f(z)w(z) dz 
and || L|| ge (or || L|| 4x) is as above. 

We remark that a linear functional on B*(C) [or A*(C)] is represented in terms of 
a minimal function over A%(C) [or B4“(C) ], since 


Sc dz = Se f(2) [w(z) — golz)] dz, 


when f(z) B¥(C) [or A*(C)] and go (z)e A“(C) [or B“(C) ]. 

Let us now discuss briefly an application for the case where D is a simply con- 
nected domain of Smirnoff type. We remark that, from the work of J. L. Doob,* it 
is easy to deduce the existence, uniqueness, and characterization for the case k = 1. 
The problem of finding minimizing functions for given functions in L*(C) (k > 1) 
reduces to that of solving the problem for the unit circle by the following theorem: 

THrorEM 5. Let D be a simply connected region bounded by a simple recti- 
fiable Jordan curve C and z = g(¢) map |¢| < 1 conformally onto Int(C) and ¢ = 
#(z) = g-'(z). Let ybe |t| = 1, De S, and f(z) L(C),k > 1. 

(i) If f(z) is minimal over B*(C) (k > 1), then f[g(d] [e’(H]!’* is minimal over 
B*(y) and conversely. 

(ii) If G(t) is the minimizing function for f[¢(t)] [¢’()]!/* over B*(y), then 
G[{@(z)] [’(z)]!/* is the minimizing function for f(z) over B*(C). 

A pplication.—Let D be as in Theorem 5 and a e D. The minimal function for 
1/(z — a) over B*(C) fork > 1 is 


C 
| 
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1 — | ®a)|? 
P(z) — P(a) 1 — ®(a) (2) 


If we choose &(z) such that ®(a@) = 0, then the minimal function for 1/(z — @) re- 
duces to (b’( ax) ‘@(z), which we will denote by h(a, z) for simplicity. 
We note that theoretically the problem of finding the mapping function is equiva- 
lent to that of finding the minimal function for 1/(z2 — a). Since h(a, 2) is mini- 
mal, then | h(a, z)| */2z'(s)h(a@, z) coincides (almost everywhere) on C with a function 
p(z) « BY(C). As usual, we define p(z) for z « Int(C) by Cauchy’s integral and 
pz) 
The function L, defined for f(z) « B“(C) (¢ > 1) by 


LN = ae, 


isa linear functional and Theorem 4 implies that 


1/q 


and equality is obtained when and only when f(z) = p(z). 

One needs only to vary the kernel w(z) in Theorem 4 to obtain other interesting 


results. 
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A NOTE ON THE ABSTRACT CAUCHY PROBLEM* 
By R. 8S. 


YALE UNIVERSITY AND THE UNIVERSITY OF SOUTHERN CALIFORNIA 


Communicated by E. Hille, February 6, 1954 


1. The notion of an abstract Cauchy problem has recently been introduced by 
E. Hille.!’ In the present note we show that if the Cauchy problem is solvable in 
a restricted sense with respect to a given closed linear operator U’ for all initial 
values in the domain of U, then U is the infinitesimal generator of a semigroup 
yielding the same solutions to the Cauchy problem. Under our assumptions, the 
existence of a unique solution to the Cauchy problem for all initial values in the 
domain of U’ is in itself sufficient to force the solutions to vary continuously with 
the initial data. 

2. We begin with a summary of semigroup theory.? A family of linear bounded 
operators [S(t);¢ > 0] on a complex Banach space & to itself is called a semigroup 
if + &) = S(t)S(b) for > 0. will be assumed to be continuous in the 
strong-operator topology for 0. In this case w) = lim,... (~! log ||S(0)|| < 
The semigroup is said to be of class (Co) if lim,.o94 S(Q)x = x for each x e X; it then 
follows that, given w > wp, there exists an VM > 0 such that S(d)|| < M exp (wt), 
t > 0. A semigroup is said to be of class (0, A) if So’ | S(t).r|| dt < o for each 
x €X and if the linear bounded operator, 

R()x = So” exp dt, (2.1) 


defined for R(A) > wo is such that lim,.... AR(A)x = x for each x ¢X. For semi- 
groups of both classes lim,.94 ¢~'[S() — /]a = Av exits for a set D(A) dense in X. 
A is called the infinitesimal generator of [S(t)]. A is closed for semigroups of class 
(Cy). For semigroups of class (0, A), A has a smallest closed extension A, called 
the complete infinitesimal generator (c.i.g). In either case the resolvent R(A; A) 
exists and R(A) = A) for RA) > Finally, if D(A), then lim,.94 = 
x and 


d[S(a]/dt = AS(t)hx = S(t)Ax (2.2) 


fort >0. If [S(é)] is of class (Co), then (2.2) holds for all ¢ = 0 with S(O) = J, by 
definition. 

3. The two formulations of the abstract Cauchy problem given below are sug- 
gested by properties of semigroups of class (Cy) and (0, A), respectively. Both of 
these formulations differ somewhat from the Cauchy problem origina!ly considered 
by E. Hille. 

ACP, (ACP2).—Given a linear operator U with domain and range in ¥ and given 
an element yo €X, find a function y(t) = y(t; yo) such that 

(i) y(t) is strongly absolutely continuous and continuously differentiable in each 


finite subinterval of {0, ©) [or (0, 


(ii) y(t) and U[y(t)] = for each t > 0; 

(ili) Limy.o4 Yo) = Yo. 

The two problems differ only in the following respect: In ACP), y’(t; yo) is con- 
tinuous for ¢ = 0, whereas in ACP», y’(t; yo) is continuous only for t > 0. In both 
cases Jo! ||y’(t; yo)|| dt< 
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We can now give a precise statement of our main results. 

THEOREM 3.1. Let U be a closed linear operator with dense domain and nonvacuous 
resolvent set. Suppose that for each yo « D(U’) there is a unique solution to ACP,. 
Then U generates a semigroup [S(t)] of class (Co) such that S(t)yo = y(t yo) for all 
ye D(U). 

THEOREM 3.2. Let U be a closed linear operator with dense domain and such that 
U)|| = as > ©. Suppose that for cach yy D(U) there is a unique 
solution to ACP. Then U generates a semigroup [S(t)| of class (0, A) such that 
S(Qyo = y(t; yo) for all yo « DCU). 

Perhaps the most critical assumption in the above theorems is that the solution 
of the ACP be unique for each yo « D(U). That this assumption is not unduly re- 
strictive follows from 

TuroreM 3.3. Jf A is the c.i.g. of a semigroup of class (0, A), then ACP» has a 
unique solution for each Yo € D(A). 

Proof: The existence of a solution to the ACP» for each yo ¢ D(A) follows from 
the relation (2.2). If there were two solutions for a given yo ¢€ D(A), then the dif- 
ference z(t) would be a null solution of ACP, that is limy,o+ 2(4) = 6. It suffices 
to show that z(¢) = @ for allt > 0. Now 


[S(t — = S(t — (—A + A)z(o) =6, 
ao 


Consequently, S(é — 8)2(8) = S(t — a)z(a) forO << a<B<t. Passing to the 
limit first as a > 0 + and then as t > 6+, we obtain 2(8) = S(8)z(0) = @ for all 
0. 

Theorems 3.1 and 3.2 are very closely related, and it is possible to establish cer- 
tain lemmas which are applicable to the proofs of both. 

Lemma 3.1. Let U be a closed linear operator and suppose for each yo « D(U) that 
there exists a unique solution to ACP2. Then for each t > 0 there exists a constant C, 
such that 


yo)|| + || U yo) + || (3.1) 


Proof:? We first introduce a new norm in D(U), namely, = |||! + || 
and denote the so topologized vector space by D(U)’. The operator U being closed, 
it follows that D(U)’ forms a Banach space. We also introduce a second auxiliary 
space &’ consisting of all strongly continuous functions x(t) defined on (0, ©) to 
D(U)’ such that || a(o)||’ do < ~. We define a metric in by 

= — 


d[x(-), y(-)] = 


where 
|x(-)|lo = So! do, 


It is easy to show that ©’ is a complete metric space. We now consider the trans- 
formation 7’ on D(U)’ to C’ which takes yo « D(U)’ into the corresponding solution 
of the ACP», that is, y(t; yo). Because of the uniqueness assumption, it follows 
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that 7’ is linear on D(U’)’.. We shall now show that 7’ is closed. To this end sup- 
pose that y, > yo in D(U)’ and that T(y,) > v(-) in ©’. In particular, y’(¢; y,) = 
U(y(ts yn) | > U[v(d)] in the ¥ norm uniformly in each closed subinterval of (0, ©) 
and Jo! ||y'(o; yn) — Ulv(e)]|| do > 0. As a consequence, the relation y(¢; y,) = 
Yn + So y'(o; Yn) do tends in the limit to v(t) = yo + So’ U(v(o)| do. It follows 
that v(t) is a solution of the ACP, with initial value yo, and the uniqueness now im- 
plies that v(t) = y(t; yo). This proves that T is closed. Hence by the closed-graph 
theorem! we see that 7' is continuous. Continuity at y) = @ implies the existence 
of constants C,,n 2 1, such that ‘ly(- : yo)| n = C,, |lyol|’ for || yol| ’ sufficiently small. 
However, since both sides of this inequality are homogeneous in yo, the result is 
valid for all yo « D(U)’; this establishes (3.1). 

LemMa 3.2. Let U be a closed linear operator and suppose for each yo ¢ D(U’) that 
there exists a unique solution to ACP». If yo D(U*), then 


y(t; = = yo), (20. (3.2) 


Proof: It yo « D(U*), then y(4; Uyo) is strongly absolutely continuous and 
Uly(t; Uyo)| = y'( Cyo) is Bochner integrable (relative to the ¥ norm) in each 
interval (0, 8). Consequently, 


yt; Uyo) = Cyo + y'(a; do = Uy + yo; Uyo) do]. 


Setting 2(4) = yo + So’ y(o; Uyo) do, we see that 2(¢) is continuously differentiable 
for ¢ = O and that 2’(t) = y(t; Uyo) = Ulz(t)]. Thus 2(é) is a solution of the ACP, 
with initial value yo, and the uniqueness im; lies that z(f) = y(¢; yo). The relation 
(3.2) now follows. 

We come now to the proof of Theorems 3.1 and 3.2. We define the operator 
S(t)yo = y(t; yo) on D(U)’ to itself for > 0. The uniqueness of the solution to the 
ACP, implies that S(é) is linear, and Lemma 3.1 implies that S(¢) is bounded. Con- 
ditions (i) and (ii) of ACP. imply that S(¢) is continuous in the strong-operator 
topology of D(U)’ for ¢ > 0. Further, uniqueness implies that y(4; + f2; yo) = 
ylh; y(te; yo) | for each yo D(U) and > 0. In terms of the operator S(é), this 
means that S(t; + f2) = S(t)S(t:), so that S(¢) defines a semigroup of linear bounded 
operators on D(U’)’, strongly continuous for ¢ > 0. Choose do € p(U’). Then for 
any x « D(U) there exists a y « D(U?) such that « = Ay — Uy. By Lemma 3.2, 
S(t)Uy = US(t)y, and hence 


S()x = wWS(Oy — SIOUy = wWS(Oy — 


Setting 47; = max. (1, dol ), we have 

+ Cy). 

Now y = R(Ao; U)a, so that < || U)|| and = — 
Mz |\x||, where Ms = M,|!R(\o; U)|| + 1. Combining this with the above relation, 
we finally obtain 


<M, 


= |} 


D 


for all x e D(U). Since D(U) is assumed dense in X, the operator S(é) can now be 
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uniquely extended to be linear and bounded on X itself. The relation (3.3) re- 
mains valid for the extended S(é), and it is easy to show that S(/) becomes a semi- 
group of linear bounded operators on X, strongly continuous for ¢ > 0. By defini- 
tion, S(Qyo = y(t; yo) for each yo e D(U) and t > 0. 

Proof of Theorem 3.1.—Condition (i) of ACP, implies that lim,.9+ y(t; yo) and 
lim,.o+ U[y(t; yo) | exist in the ¥ norm for each D(U). Therefore, lim,.9+ 
exists in D(U’)’ for each yo ¢« DCU)’, and, by the uniform boundedness theorem, 
'S(t)|| remains bounded near ¢ = 0. According to (3.3), S(é) is likewise bounded 
near = 0. Condition (iii) of ACP, asserts that lim,—.9+ | S(t)a — x|| = 0 for each 
xe D(U), and, applying the Banach-Steinhaus theorem, we see that the same is 
true for all x « X. Consequently, S(¢) is of class (Cy). It remains to show that 
A=U. If ye D(U), then by condition (i) of ACP; we see that S(O) yo = y(t yo) is 
in particular differentiable at / = 0; thus yo ¢€ D(A). Also y(t; yo) > yo and U[y(t; 
yo) | = y(t; Yo) > Ayo, and this, together with the fact that U’ is closed, implies 
that Uyo = Ayo. Uyo = Ayo; in otherwords U ¢ A. Consequently if yo ¢D(U) 
and R(A) > we, then exp (— At) US(t)yo = exp (— At) S(t) Ayo is Bochner integrable 
on [0, ©) and hence UR(A; A)yo = Jo” exp(— dt)S(t)Ayodt exists. Thus UR 
A)y = A)y = ARQ; A)y — y for all y D(U), and, since D(U) is dense 
in ¥ and U is closed, we see that this relation holds for all y « ¥. Since R(\; A) 
[¥] = D(A), it follows that D(U) > DCA) and hence that A = U. This concludes 
the proof of Theorem 3.1. 

Proof of Theorem 3.2.—In the statement of Theorem 3.2 we have assumed 
that ||R(A; U)|| = O(1/A) ask > ©. Thus, for xe D(U), — = 
U)Uz|| 0 as and, by the Banach-Steinhaus theorem, lim, 
U)x =2 forallz eX. Suppose that ye D(U). Then y, =nR(n; U)y « D(U?) 
forn sufficiently large, y, > y, and Uy, = nR(n; U)\Uy—> Uy. According to Lemma 
3.2, US(t)y, = S(t)Uy,; and, passing to the limit asn > ©, we see that US()y = 
S(t)Uy; here we have made use of the fact that U is closed. Finally, if yo « D(U), 
then S(t)Uy = US(Q)yo = y’(t; yo) is Bochner integrable near ¢ = 0 by condition 
(i) of ACP. Now for Xo « p(U) and x ¢ X there exists a y « D(U) such that x = 
hy — Uy. It follows that S()x2 = A» S(t)y — S(t)Uy is Bochner integrable near 

= (0. We now define R(A) as in (2.1). It can be shown that R(A) defines a linear 
bounded operator for each A, R(A) > wo. For yo « D(U) we have, as above, 


RAUYy = So” exp yo) dt, 


and an integration by parts gives R(A)Uyo = —yo + AR(A)yo. Thus R(A) (AT — 
U)y = y for all y e D(U). If d is sufficiently large, then A € p(U) and (AJ — U) 
maps D(U’) onto X, from which it follows that R(A) = R(A; U). Thus \R(A)zr = 
AR(A; U)a > x as\—> © for each x ¢ X, and hence [S(t)] is of class (0, A). Finally, 
as we have already noted, R(A) = R(A; A) for \ > wo, where A is the c.i.g. This 
implies that A = U, which concludes the proof. 


* This paper was written under the sponsorship of the Office of Ordnance Research, U.S. Army, 
under contract DA-04-495-OR D-406. 

+ Present address, Department of Mathematics, Yale University, New Haven, Conn. 

'E. Hille, “A Note on Cauchy’s Problem,” Ann. Soc. polonaise math., 25, 56-68 (1952); “Une 
généralisation du probleme de Cauchy,” Ann. Inst. Fourier, 4, 31-48 (1953). 

* See E. Hille, “Functional Analysis and Semi-groups,’’ Amer. Math. Soc. Coll. Lectures, 
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(1948); and R. S. Phillips, ‘An Inversion Formula for Laplace Transforms and Semi-groups of 
Linear Operators,” 59, 325-356 (1954). 

’ The method of proof employed in Lemma 3.1 was suggested by some work of P. Lax on the 
continuous dependence of the solution of a linear partial differential equation on initial data. 

‘§$. Banach, Théorie des opérations linéaires, Warszawa, 1932, p. 41. 


NEW TYPES OF TRINOMIAL CONGRUENCE CRITERIA APPLYING TO 
FERMAT’S LAST THEOREM 
By H.S. VaANnDIVER 
UNIVERSITY OF TEXAS, AUSTIN, TEXAS 
Communicated Feburary 20, 1954 


In another paper! the writer obtained a result which may be stated as follows: 
Jf lis.an odd prime and 


vty +2 =0 (1) 
and y =0 (mod 1); (xz, l) = 1, the prime ideal » in K (the algebraic field defined by 
¢ is adivisor of the ideal (p); p a rational prime; 

N(p) — 1 


where N(p) is the norm of p, further, p belongs to an exponent? which is prime to | with 
xyz # 0 (mod p), then 


c e 


is divisible by p fors = 1,2,...,l1—1,andy/x =t 

In another article’ we developed these criteria further and obtained criteria inde- 
pendent of ¢ which were expressed in determinant form. These were complicated, 
however, and seemed too difficult to handle. The problem then arose to obtain, if 
possible, the equivalent of some of the material used in the proof of Theorem I ina 
form which seemed simpler. Such a form was found by the writer in the year 
1944 but is published here for the first time. 

We have! (relation [5a]), with the conditions in Theorem I on yp, 


(x + =1 p). (3) 


Now let p = 1 + cl, that is, the ideal p is of the first degree. Now the statement in 
(3) is equivalent to the statement that 


r+ (mod>), (4) 


a=1,2,...,p— 1, with w, some integer in K. We may select a rational integer 
J, 28 a primitive root of the ideal p such that 


(9° P), (5) 


— 


where we set g for g, during an argument where p is fixed (as in the work up to the 


a 
fi 
\ 
2 
| 


Ips of 


mn the 


OWS: 


the 


Vou. 40, 1954 MATHEMATICS: H. S. VANDIVER 


statement of Theorem II which follows). Then this gives, using (5), 
(mod p). 
and consequently, from (4), 


ut g“y = w,' (mod yp). 


irther, since w, = +... + with the b’s rational integers, 
1-2 1-2 
Wy = > be = (mod p), 
1=0 7=0 


and substituting in (4) gives 
vt gy =k,' (mod p) (6) 


fora = 1,2,...,p — 1, where k, is a rational integer and (k,, p) = 1. From (1) 
we also have, using the given condition (z, p) = 1, 


xt+ y= ko'; (ho, P) = (6a) 


where iy is a rational integer. Now g is also a primitive root of the rational prime 
p, and, since by definition zyz # 0 (mod p), we have integers r and s such that 
g’ =xand g = y (mod p) and k, =g"« (mod p): (6) and (6a) then give, for a = 0, 1, 


1 = (mod p). (7) 
Now in another paper‘ the writer considered the congruence for a fixed 7 and J, 
1+ gi!  (modip), (8) 


0<i<e-—1;0 <j <l— 1. We consider the problem of finding the distinct 
sets x, and x2 in (7) with 2, in the set 0, 1,...,2— land 2 in the set 0, 1,...,¢- 
1. We shall represent’ the number of these distinct sets x, x2 by (7, j),,, and use 
(i, j) when throughout an argument c and / are fixed. If e and f are any integers, 
we define (e, f)., as (/e|, |f|)e, where |e| represents the least positive or zero resi- 
due of e modulo c, and | f| equals the best positive or zero residue of f modulo 1. 
Evidently (7,7) = (¢ + ca, j + 18). We know (N, relation [4]), since / is odd, that 

0 if i ¢/2, 

if i = ¢/2. (9) 
Now in the relation (7) of the present paper we have / congruences with values of 
a=0,1,2,...,2— 1 with corresponding values ¢,. In effect, this states that the 
number of solutions 2, 22 in (7) is, for the present particular case, equal to (s — r, 
! — r) and that this number equals /. 

We may then state: 

THeoreM II. Jf (1) 7s satisfied in non-zero integers, | is an odd prime with y = 0 
(mod 1) and (xz, 1) = 1, the prime ideal » in K belongs to an exponent prime to l, p 
divides the ideal (p) where p is a rational prime of the form 1 + cl, xyz # 0 (mod p), 
then there is an i and aj in (8) such that (7, J) .: = Ll. 

In N we refer* to what we called the two extreme cases in connection with the 
(7, 7), first, where a given 7 was such that (7, 7) was unity for each j and the second, 
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when for a given 7 there exists a j such that (7, 7) = 1. The second case applies to 
our Theorem II. It follows from (9) also that when we state that (7, /) = / fora 
particular 7 and 7, then (7, h) = 0 for h # j (mod J), 

In N, Mrs. Pearson and the writer examined the (7, j)’s for the values of | = 
3, 4, 5, 6, 7, 11, and 13, and various primes p of the form 1 + cl. It was found for 
1 > 5 that for none of the 34 primes p tested did the second extreme case hold for 
anyzandj. (The primes p were not selected as being necessarily such that a prime 
ideal divisor p of (p) belongs to an exponent prime to /.) The largest p examined 
was 1123 with/ = 11. 

In N, we considered our trinomial congruence initially in the form 


1 + av’ = bw” (mod p), (10) 


where abry # 0 (mod p), with p an odd prime. We signalized three problems 
involving (10), and we shall now consider the third one treated there; namely, 
“Tfin (10) aand b are fixed integers, abry # 0 (mod p), m is fixed, and p increases to- 
gether with ¢ subject to the relation me = k(p — 1) with ka fixed integer, what can 
be said about the number of the sets of solutions x, y?’’ We shall now consider the 
relation of this to the problem of our present paper. If we take our relation (6) 
and divide through by x, noting that (2, p) = 1, the result may be written in the 


form 
1+ =y'h,' (mod p), (11) 


where hay =k, (mod p). We.also set here & = 1 and m = 1. It will be seen that 
(11) is of the same form as (10) except that y/x in (11) is not expressed explicitly in 
integral form. Owing to this fact, if we do express this y/2 as congruent to an in- 
teger modulo p, then this integer will vary with each p. In view of the fact that 
what theory we have concerning these trinomial congruences nearly all involves 
the use of primitive roots as in (8) of this paper, we shall continue our investigations 
on this basis. In spite of the fact that a primitive root changes with each prime, 
it turns out that we shall be able to treat (11) when x and y are fixed and p varies 
over the various primes. This will be shown elsewhere. 

In connection with our problem concerning (10) as applied to (1), we shall show 
that it is possible to select a class of primes p, for a certain (7, 7), such that the con- 
gruence (10) is satisfied for m = / (an odd prime) sets of values v’, w”. Set in (10), le 
= p — 1, witha = 1 and b = | and select p so that it is the norm of a prime ideal 
p with pa principal ideal (w) such that 


wo = 1+ 


\ = ¢ — 1, 6 an integer in K. Also we can choose our primitive root so that (4) 
holds. With a = 1 and b = 1 in (10) and applying (5), we have, noting them 
that the possible incongruent values of v’ in (10) are congruent to 1, ¢, ¢?,..., ¢°74 
modulo y, 
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fora = 1,2,...,¢— 1, where (3 p) is a power of ¢ such that 


(2) (mod p): [(8), p] = 1, 


with 8 an integer in A. The relation (12) follows from the fact that (1 + ¢*) isa 
unit in A and w is primary.® 
Again, since w is primary, we have, by the law of reciprocity for /th powers,° 


This shows that, if (10) is written in the form (8) by employing primitive roots, after 
we set m = /, the resulting congruence (8) has / solutions 2, 2», that is,’ (0,0) = J. 
By a known theorem, the set of principal ideals (1 + 2A‘@), if we let @ range over the 
integers in A, contains an infinity of prime ideals.* 

In Theorem I we may replace x by z and z by x in its statement. Consequently, 
for p sufficiently large, there are two different quantities (2%, j1) and (?, j2) which 
equal /. Since in the statement of Theorem I we have to replace y/x by y/z, and 
since from (11) we see that for p sufficiently large, 


y/x y/z (mod p), 


in fact, it is only necessary to take p as a prime > 7/(2 — z) to satisfy said condition, 

As noted earlier,! Theorem I refers only to the second case of Fermat’s last theo- 
rem. In the other, or first, case, a criterion has been given* (p. 149, Theorem IV) 
based on methods analogous to those used in our original’ proof of Theorem I. 
Using the methods of the present paper, we may give a companion theorem to our 
present Theorem ITI but which refers to the first case of the Fermat problem. How- 
ever, the details will not be given here. 

We have arrived at various consequences of the criteria mentioned here which we 
hope to be able to publish in further papers. 


'“Summary of Results and Proofs on Fermat’s Last Theorem (Third Paper), these Pro- 
CEEDINGS, 15, 45 (1929). Note that Theorem I refers to the second case of Fermat’s Last Theorem 
as y=0(mod /), zz 0(mod /). 

? We say that an ideal DP belongs to an-exponent n when )" is a principal ideal but “is not a prin- 
cipal ideal for 0 <d <n. 

*“Certain Congruence Criteria Connected with Fermat’s Last Theorem,” these PROCEEDINGS, 
28, 144-150 (1942). 

4“On a New Problem Concerning Trinomial Congruences Involving Rational Integers,” these 
PROCEEDINGS, 39, 1278-1285 (1953). This paper will be referred to as ““N.” 

In his paper “On Cyclotomy and Extensions of Gaussian Type Quadratic Relations Involving 
Numbers of Solutions of Conditional Equations in Finite Fields,” these PRocEEDINGS, 38, 981- 
991 (1952), the writer used a notation (relation [1] of that paper) which for the form (8) of the 
present paper is equivalent to the statement that (7, j)c: is the number of solutions of the congru- 
ence 


4 gitlvz=1 (mod p) 


in the y’s, but in N this number was designated by [i, j]e. A reader who is referring to these several 
papers of the author should be careful to note these different notations. Cf. note 6 of N. 
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°T. Takagi, ‘Ueber das Reciprocitiitsgesetz in einem beliebigen algebriiischen Zahlkérper,” 
J. Coll. Science, Tokyo Imp. Univ., 44, 26 (1922), Satz 11. 

7 For | = 3, this is exemplified by the cases p = 109 and 127. For / = 5 an example is p = 
61051. We have not determined the smallest prime of the latter type. 

8 E. Landau, ‘“‘Ueber die Verteilung der Primideale in den Idealklassen eines algebriiischen Zahl- 
kérpers,” Math. Ann., 63, 196-197 (1907); T. Takagi, ‘Ueber eine Theorie des relativ Abel’schen 
Zahlkorpers,” J. Coll. Science, Tokyo Imp. Univ., 41, 19 (1920); H. Hasse, “Bericht ueber neuere 
Untersuchungen und Probleme aus der Theorie der algebriéischen Zahlkérper,” Zahreshericht 
Deutsch. Math.-Vereinigung, 35, 1-55 (1926). 


DIRECTIONAL SENSITIVITY OF SINGLE OMMATIDIA IN THE 
COMPOUND EYE OF LIMULUS* 


By H. WaTeERMAN 
OSBORN ZOOLOGICAL LABORATORY, YALE UNIVERSITY 


Communicated by J. S. Nicholas, February 9, 1954 


Compound eyes differ from those of other types, not because the mosaic quality 
of their image is imposed by a cellular retina; this is so in all eyes. Rather the 
compound eye is unique in having a dioptric apparatus which is itself also a mosaic, 
one made up of facets and lenses, each focusing light on different specific parts of a 
convex retina. Because of restricted angular apertures and differences in the 
orientation of the axes of these optic elements, single ommatidia do not individually 
survey the whole visual field of the eye. Instead, one such unit can be stimulated 
only by a fractional part of the total field, a small patch lying near its own optical 
axis. An image of this spot is the ommatidium’s contribution to the over-all visual 
mosaic. 

Accordingly, the directional sensitivity of an ommatidium in the compound eye 
will be one of its fundamental physiological properties. Despite such importance, 
however, practically no direct information is available on this point. The present 
report describes some measurements of this function initiated in the lateral eyes of 
Limulus. Their relationship to the mechanism of polarized-light sensitivity will be 
considered subsequently.! 

Materials and Methods.—Lateral eyes of Limulus were prepared for study as 
described previously.2. However, two modifications in the method of mounting the 
eye were required by the present experiments. (1) To avoid the problem of un- 
controlled reflection and refraction when the eye was rotated, it was mounted in air 
and sealed with paraffin wax to a plastic holder in such a way that the optic nerve 
passed through a slot into the physiological saline, under which it was necessary to 
isolate single responding units. (2) The chamber was then placed on a circular 
stage mounted on a dissecting microscope. This permitted the eye’s orientation in 
azimuth to be conveniently and accurately changed while the nerve and the record- 
ing electrodes could be observed closely. In preliminary experiments, an additional 
adjustment in elevation was included, but the necessity of using the saline bath in 
preparing fine bundles of axons prevented its effective use. Consequently, the 
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directional sensitivity of ommatidia in different anatomical planes had to be dis- 
tinguished by mounting eyes in various positions on the holder. 

The location and relative orientation of the ommatidia illuminated by the 
stimulating light were controlled by covering all but certain facets with opaque 
paint, as described elsewhere.* In this way axons from exposed ommatidia in the 
center of the eye, which were most favorable for these experiments, could be 
recognized at once. It will be seen in Figure | that in this region the optical axes 
of the corneal lenses are normal to the eye surface. In the more peripheral areas of 
the eye, as observed long ago by Exner,‘ marked deviations up to 30-40° from 
normality occur. 


Fic. 1.—Camera lucida drawings of sections of the cornea and corneal lenses of a : 
Limulus lateral eye (Juvenile 2, 40 mm, prosoma length). The angle between the t 
optical axes of marginal elements is 111° in the dorsoventral plane, 184° in the antero- 
posterior one. The actual visual field of the eve would be greater than this because of 
the large angular aperture of the ommatidia and the refractive effect of the cornea, 
where the optic axis is not normal to the surface. The angle of separation between the 
axes of neg bleviine facets varies from 4° to 15°. X 25. 


The optical system used to stimulate the eye comprised the following. The 
source was a 6-volt, 18-ampere ribbon-filament microscope lamp, the image of 
whose uniform emitting surface was focused on a depolarizing filter comprising three 
layers of ordinary waxed paper mounted between optical blanks. In addition to the 
focusing lens, between the lamp and the depolarizer were a heat filter, a neutral 
Wratten filter for adjusting stimulus intensity in large steps, and two Polaroid sheets 
used for fine adjustments of transmitted intensity. A shutter was placed between 
the depolarizer and the Limulus eye preparation. Since this filter not only de- 
polarized the light but also scattered it diffusely, it acted as the effective light source 
for stimulating the ommatidia. Consequently, the functional aperture for the 
stimulus was set by the diaphragm of the shutter. The geometry of the system was 
such that this evenly illuminated surface subtended an angle of 8° at the cornea. 
While further quantitative experiments need to be done with a smaller stimulating 
light source, it will be seen below that the directional sensitivity of the ommatidia 
in the Limulus eye is so broad that this factor did not greatly affect the present 
results. 
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The recording technique employed was standard, with tapered silver wire pickup 
electrodes, condenser-coupled amplifiers, and moving-film photography of the 
cathode-ray tube traces. 

Experimental Results.—Three different ways of measuring directional sensitivity 
yielded the same qualitative conclusion, namely, that a single Limulus ommatidium 
is responsive over an astonishingly wide range of incident angles. In response to 
brief flashes of light, discharges could readily be obtained from a single unit in the 
optic nerve with the stimulus striking the cornea at angles as great as 30-40° away 


Fig. 2.—Responses of a single ommatidium to 0.7-second flashes 
of light striking the cornea at angles differing from an arbitrary 0° 
by the amounts indicated. The ommatidial axis, normal to the 
corneal surface, was located at about 45° on this scale. With light 
incident at — 10° and at 90° or beyond, no response was obtained at 
the light intensity used throughout this series of exposures. The 
upper line of the records shows the single-unit optic-nerve impulses 
resulting from the flashes; the lower one indicates the occurrence 
and duration of the flash on the same temporal abscissa. 


from the optical axis (Fig. 2). Within this range, of course, the number of impulses 
resulting from a flash of given intensity and duration decreased as the angle of 
incidence deviated to either side of the optical axis. Similarly, when a single 
ommatidium was stimulated with continuous illumination, a steady discharge was 
obtained over a wide range of incident angles (Fig. 3). Even in such light-adapted 
units, responses occurred with the light entering at 30° on either side of the op- 
tical axis. 

The third way of measuring directional sensitivity, in addition to confirming the 
above results, also provided more quantitative data on the phenomenon. In this 
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case the threshold for a particular electrophysiological response was found as a 
function of the angle of stimulus incidence. The standard response was a series of 
four nerve impulses resulting from 1-second flashes given every 15 seconds. Stimu- 
lus intensity was adjusted at the various angles until this response was just 
obtained. 


Fic. 3.—Responses of a single ommatidium to continuous illumina- 4 
tion of constant intensity but of various incident angles, as indicated j 
relative to an arbitrary 0°. The eye was light-adapted for 30 sec- 
onds at each of the different incident angles before the record was 
made, The optical axis of the unit, normal to the corneal surface, 
was between 30° and 35° on the arbitrary scale. The lower line 4 
in each strip indicates time in 0.1-second intervals. 


Typical data secured in this way with either horizontal or vertical rotation of the 
eye are plotted in Figures 4 and 5. Obviously, the lowest thresholds were found 
close to the optical axes of the ommatidia, although high degrees of sensitivity 
extended out to 10-20° on either side. Responses frequently occurred with 40° 
deviations from the axis. Even larger angles were not unusual, and on one occasion 
the threshold in question at a 90° incident angle was less than 4 log units above the 
lowest threshold measured directly on the optic axis. Note that the effective 
apertures of a single ommatidium are twice the size of these deviations, carrying 
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o° 30° 60° 


SER INCIDENCE 


Fic. 4.—Thresholds of three single ommatidia as a function of the 
angle of stimulus incidence in the dorsoventral plane. The 0° rep- 
resents the direction of the optical axis. The distinction between 
the absolute and angular sensitivities of the three units are the result 
of different orientations of their axes in the anteroposterior plane. In 
the most sensitive case the incident light made the smallest angle with 
the axis of the unit concerned. The threshold utilized to determine 
the light intensities plotted was a response consisting of 4 impulses 
in the optic axon involved. Zero on the light-intensity scale was 1 
meter-candle. 


STIMULUS Qf INCIDENCE 


90 
+2 (7) -/ -2 -2 -/ 7) 

Fic. 5.—Thresholds of three single ommatidia as a function of the angle of 
stimulus incidence in the anteroposterior plane. Plotted on polar co-ordinates. 
Other features of the measurements and graph are the same as described for 
Figure 4. 
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high degrees of sensitivity to 40°, maximum to 180°! This should justify the 
statement made above that the 8° extent of the stimulus, although it could 
obscure fine detail, particularly close to the axis, had little influence on the general 
outeome of the measurements. Observe also that directional sensitivity appears to 
be similar in the two different anatomical planes. 

Discussion.—The most important physiological consequence of the angular 
sensitivity of an ommatidium will be its relation to visual acuity. Nearly all the 
critical parameters of acuity, such as resolving power of the dioptric components,* 
the cellular mosaic of the retinulas concerned,’ nervous integration,” * intensity 
and wave-length of illumination, contrast, and the spatial discrimination under test, 
were adequately controlled in the experiments. Consequently, the main factor to 
be discussed here is the relation of the observed directional sensitivity to the angle 
separating the ommatidial axes. 

Measurements of corneal sections like those shown in Figure 1 indicate that the 
maximum angular separation of ommatidial axes is 15° or less, the minimum 4—5°. 
Clearly, then, the overlap in the visual fields of neighboring units must be extensive, 
since thresholds may not rise more than 2 log units with light incident 15° from the 
optical axis. Visual acuity might, for this reason, be less, and perhaps considerably 
so, than would be predicted from the number and angular separation of the omma- 
tidia. Such a visual disability would fit in well with evidence from several other 
lines of work, suggesting that xiphosuran lateral eyes are degenerate structures.*: ° 
On the other hand, the high acuity of vertebrate retinas, despite marked overlap in 
the receptive fields of single optic-nerve fibers,!° should provide a caveat against 
generalizing from optical or neurological data alone. 

Experiments to determine the visual acuity of Limulus directly are now under way 
in the author’s laboratory, with the optomotor response to moving stripes as a 
measure of visibility. Comparison of such behavioral results with the present 
electrophysiological data should further our understanding of the over-all problem. 

This report may be briefly summarized by saying that measurements of direc- 
tional sensitivity in single Limulus ommatidia show these units to respond over 
a broad range of incident stimulus angles. Although high sensitivity was centered 
within 10-20° of the optical axis, responses to reasonable light intensities were ob- 
tained 80-90° from this referent. 
* These studies were aided by a contract between the Office of Naval Research, Department of 
the Navy, and Yale University, NR 163-091. A preliminary account of this work was presented 
at the December, 1953, meeting of the American Society of Zoédlogists (Anat. Rec., 117, 566, 
1953). 

'T. H. Waterman, Proc. Natu. Acab. Scr., 40, 258-262 (1954). 
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POLARIZED LIGHT AND ANGLE OF STIMULUS INCIDENCE 
IN THE COMPOUND EYE OF LIMULUS* 


By Tatsor H. WATERMAN 
OSBORN ZOOLOGICAL LABORATORY, YALE UNIVERSITY 
Communicated by J. S. Nicholas, February 9, 1954 


Evidence has been accumulating that the polarized-light sensitivities of various 
single ommatidia in the Limulus eye were different and that even the same omma- 
tidium might show altered responses under changed conditions. Since the orienta- 
tion of the optic axes of these units varies widely from one place to another in the 
eye, the most likely parameter concerned here seemed to be the incident angle of 
the stimulus. Consequently, experiments have been initiated to test the effect of 
this factor systematically. The general results so far obtained with nonpolarized 


Fig. 1.—Polarized-light sensitivity of a single element of the Limulus lateral 
eye to 1-second flashes of plane-polarized light having a constant intensity but 
different planes of polarization, as indicated for each exposure. Maximum 
stimulation occurred near 0° and 180°; minimum near 90°, where only half as 
many optic-nerve impulses resulted from the standard flash. 


light and a discussion of the significance of such directional sensitivity for visual 
acuity have already been presented.' This report describes the outcome of similar 
experiments with polarized light and considers the possible usefulness of such 
findings in determining the mechanism of polarized-light sensitivity. 

Material and Methods.—The experimental preparation, the optical system, and 
the recording technique were all essentially the same as those used previously.’ 
The only major innovation was the introduction of an adjustable Polaroid filter 
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between the depolarizer and the shutter of the optical system. This allowed system- 
atic changes to be made in the plane of polarization of the stimulating light 
without affecting either its direction or its intensity. It is worth noting that the 
presence of the depolarizer may, by eliminating all stray polarization of whatever 
origin, be a highly important factor in the effective conduct of such experiments. 

Results.—Whenever an electrophysiologically single element of the optic nerve 
showed marked sensitivity to the plane of polarization of the light illuminating the 
corresponding ommatidium, the following relationships were found. With con- 
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PLANE OF STIMULUS POLARIZATION 
Fic, 2.—Effeet of eve orientation on polarized-light sensitivity. Preliminary experiment in 
which it was observed that the high polarization sensitivity shown by a particular ommatidium in 
one orientation relative to stimulus incidence (curve A) was reduced to less than half when the eye 
was rotated (and hence the angle of incidence changed) through 30° (curve B). 


tinuous illumination, the frequency of optie-nerve impulses varied systematically 
with the plane of polarization passing through maxima and minima 90° apart.’ 
With flashes of light, provided that the unit was sufficiently sensitive to polarization, 
different numbers of action potentials resulted from exposures of the same intensity 
and duration but of different planes of polarization (Fig. 1). Again maxima and 
minima occurred 90° apart. Initially it was found that rotation of the eye into 
a new orientation relative to the stimulus could markedly affect the difference 
between these maximal and minimal points (Fig. 2). 
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Subsequently, the position of the optical axis of a single ommatidium was de- 
termined functionally by finding the most effective incident angle for the stimulus, 
This could be accomplished readily if only those ommatidia with optical axes 
normal to the corneal surface were used. Restriction of the excitable elements to 
the central region of the eye with opaque paint was an effective way of insuring this, 
as already described.!_ When the stimulus entered an ommatidium parallel to its 
optic axis and normal to the corneal surface, little or no polarized-light sensitivity 
was shown in the responses. This was true at least for flashes (Fig. 3); comparable 
measurements have not yet been made in continuous light because such long ex- 
posures are required to obtain adequate data. In contrast, units whose response 
to normally incident light was like that shown in Figure 3 gave evidence of an 

_ ability to analyze polarized light when the light stimulus had a large incident angle 
to the ommatidial axis (Fig. 4). 


Fig. 3.—Virtual absence of polarized-light sensitivity of a single ommatidium 
exposed to 1.0-second flashes of plane-polarized light whose angle of incidence 
coincided with the optical axis of the unit. Successive flashes were of the same 
intensity, but differed in their planes of polarization as indicated. In all planes 
the same number of impulses were evoked by the flash. The upper line in 
each record shows the optic axon response to the flash, and the lower line the 
initiation and duration of the stimulus on the same temporal abscissa. 


Discussion.—The discovery that the polarized-light sensitivity of the compound 
eye of at least one arthropod shows marked dependence on the angle of stimulus 
incidence may provide important clues to the mechanism of polarized-light percep- 
tion.? In addition, the evidence that little or no polarization analysis was done on 
light incident at an angle close to the optical axis indicates the kind of further data 
necessary for the solution of this problem, namely, a precise quantitative knowledge 
of the relation between angle of incidence and polarization sensitivity. Before 
this additional information is obtained, however, it is possible to consider in how 
far the present findings illuminate the question of mechanism. 

Perhaps the first possibility to be considered is that the polarization analyzer of 
arthropod eyes depends on simple reflection-refraction phenomena at the external 
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surface of the cornea. These would obviously be dependent on angle of incidence, 
as expressed quantitatively by Fresnel’s formulas. Furthermore, the differential 
reflectance of light polarized in various planes is shown by these formulas to be 
negligible at low angles of incidence but to be considerable at the polarization angle 
and other large angles. As far as it goes, then, the present qualitative data might 
be accounted for on such a model. However, the possibility of complicating effects 
from internal reflection and refraction within the dioptric elements of the ommatid- 
jum should not be overlooked. Nor should one forget the angular sensitivity of 
the other possible means of analysis, which will be mentioned below. 

Experimental data on the behavior of Drosophila, Daphnia, water mites, and 
mosquito larvae® have, in fact, already been interpreted to indicate that refraction- 
reflection phenomena in the eyes form the sole sensory basis of their polarized-light 
behavior patterns. However, more extensive evidence is desirable for such a con- 
clusion, particularly if it is to be considered of general applicability. 


Fic. 4.—Polarized-light sensitivity of the same ommatidium whose re- 
sponses are shown in Figure 3. In the present case the angle of stimulus inci- 
dence was 45° to the optical axis. As before, 1.0-second flashes of plane- 
polarized light were used as stimuli. These were of constant intensity but of 
different planes of polarization, as labeled. The lower line of the record again 
shows the initiation and duration of the stimulus on the same abscissa as the 
electrical activity of the optic axon responding. 


Data from other arthropod eyes have been found to support different polarized- 
light analyzing mechanisms. Autrum and Stumpf* concluded from analysis of elec- 
troretinograms in the honey bee and in Calliphora that individual retinular cells 
act as detectors of the plane of light polarization. Since the stimulating light 
apparently entered the ommatidia responding at angles close to their optical axes, 
these data and their interpretation are incompatible with the kind of reflection- 
refraction mechanism proposed by Baylor and Smith and by Stephens, Fingerman, 
and Brown. Nor could Autrum and Stumpf’s mechanism work in Limulus, since 
there each ommatidium ordinarily contributes but one active fiber to the optic 
nerve,® a restriction which apparently is not present in some insects.” 
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Optical indications that birefringence of the rhabdome in certain dipteran eyes 
is coupled with a mechanism for the extinction of one ray, somewhat as in a Nicol 
prism, were reported by Menzer and Stockhammer.’ Finally, it has been sug- 
gested,* 7 partly on the basis of an analogy with the mechanism of seeing Haidinger’s 
brushes by the human eye, that some dichroic element in the arthropod ommatidium 
might provide the required polarization analyzer. 

Note that the three physical methods suggested as means of polarization analysis— 
(1) reflection-refraction, and birefringence with one ray eliminated either (2) by 
reflection or (3) by absorption—are all sensitive to the direction of the light rays 
traversing the system. Consequently, the present data do not alone provide 
adequate grounds for choice among them. 

On the basis of the incomplete and conflicting information at present available on 
the mechanism of polarized-light analysis by arthropods, several points are clear. 
First, there may be various mechanisms operating in the eyes of different groups. 
Second, much more work needs to be done before any adequate general conclusions 
may be drawn. Finally, a quantitative investigation of the relation between 
polarization sensitivity and angle of light incidence is highly desirable in the case of 
Limulus at least. The precise shape of the function may indicate definitely the 
type of mechanism involved in this case. Also changes in the function induced by 
altering the refractive index of the external medium should show whether reflec- 
tion-refraction at the corneal surface is a critical factor or not. 

The main points of this report may be summarized as follows. The sensitivity of 
a Limulus ommatidium to polarized light varies with the angle of stimulus incidence. 
It is low or absent close to the optical axis and high at larger incident angles. Since 
the various suggested mechanisms for polarization analysis would all show some 
directional sensitivity, the present qualitative data do not permit the analytical 
device to be specified. Quantitative data of the same sort, however, might do so. 


* These studies were aided by a contract between the Office of Naval Research, Department of 
the Navy, and Yale University, NR 163-091. A preliminary account of this work was presented 
at the December, 1953, meeting of the American Society of Zodlogists (Anat. Rec., 117, 566, 1958). 
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